Server and method of controlling the same

ABSTRACT

A server which interacts with a display apparatus is provided. The server includes a storage unit configured to store conversation patterns for each service domain, a communication unit configured to receive a user&#39;s voice from the display apparatus, and a control unit configured to determine a service domain including the user&#39;s voice, generate response information corresponding to the user&#39;s voice based on a conversation pattern of the determined service domain, and to control the communication unit to transmit the response information to the display apparatus. When it is determined that a currently received user&#39;s voice is included in another service domain which is different from a service domain including a previously received user&#39;s voice, the control unit generates the response information corresponding to the currently received user&#39;s voice based on a conversation pattern of the other service domain.

CROSS-REFERENCE TO RELATED APPLICATION

This application claims priority from Korean Patent Application No.10-2012-0064505, filed on Jun. 15, 2012, in the Korean IntellectualProperty Office, the disclosure of which is incorporated herein byreference in its entirety.

BACKGROUND

1. Field

Apparatuses and methods consistent with exemplary embodiments relate toa server and a method of controlling the same, and more particularly, toa server which controls a display apparatus capable of interacting witha user, and a method of controlling the same.

2. Description of the Related Art

With the development of electronic technology, various kinds of displayapparatuses have been developed and spread, and have included variousfunctions according to demand of the user. In particular, in recentyears, the television (TV) has been connected to the Internet andsupports an Internet service and, thus, the user can watch a largenumber of digital broadcasting channels through the TV.

In recent years, technology using voice recognition has been developedto control the display apparatus more easily and intuitively. Inparticular, the TV can recognize the voice of the user and perform afunction corresponding to the voice of the user such as volumeadjustment or channel change.

However, the display apparatus in the related art which recognizes theuser's voice provides the function corresponding to the recognizedvoice, but the display apparatus does not to provide interactiveinformation through the conversation of the user.

SUMMARY

One or more exemplary embodiments may overcome the above disadvantagesand other disadvantages not described above. However, it is understoodthat one or more exemplary embodiments are not required to overcome thedisadvantages described above, and may not overcome any of the problemsdescribed above.

One or more exemplary embodiments are to provide a server capable ofcontrolling a display apparatus which interacts with a user byconsidering a service domain including the user's voice, and a method ofcontrolling the same.

According to an aspect of an exemplary embodiment, there is provided aserver which interacts with a display apparatus. The server may include:a storage which stores conversation patterns for service domains; acommunicator configured to receive a voice expression from the displayapparatus; and a controller configured to determine a service domainrelated to the voice expression, generate response informationcorresponding to the voice expression based on a conversation pattern ofthe determined service domain, and to control the communication unit totransmit the response information to the display apparatus. When it isdetermined that a currently received voice expression is included inanother service domain which is different from a service domainincluding a previously received voice expression, the controllergenerates the response information corresponding to the currentlyreceived voice expression based on a conversation pattern of the otherservice domain.

When it is determined that the currently received voice expression isincluded in the other service domain even before slots constituting theservice domain are filled with the previously received voice expression,the controller generates the response information corresponding to thecurrently received voice expression based on the conversation pattern ofthe other service domain.

The controller determines an utterance intention included in thecurrently received voice expression, generates the response informationcorresponding to the determined utterance intention based on theconversation pattern of the other service domain, and controls to thecommunicator to transmit the response information to the displayapparatus.

The response information comprises response message informationcorresponding to the voice, and the display apparatus outputs a responsemessage corresponding to the voice based on the response messageinformation.

The response information further comprises a control command forcontrolling a function of the display apparatus.

According to another aspect of an exemplary embodiment, there isprovided a method of controlling a server which interacts with a displayapparatus. The method may include: receiving a voice expression from thedisplay apparatus; determining a service domain including the voice,using conversation patterns; generating response informationcorresponding to a currently received voice expression based on aconversation pattern of another service domain when the currentlyreceived voice expression is included in the other service domain whichis different from a domain having a previously received voiceexpression; and transmitting the generated response information to thedisplay apparatus.

The generating comprises generating the response informationcorresponding to the currently received voice based on the conversationpattern of the other service domain when it is determined that thecurrently received voice expression is included in the other servicedomain before slots constituting the service domain are filled with thepreviously received voice expression.

Alternatively, the generating comprises determining an utteranceintention included in the voice expression and generating the responseinformation corresponding to the determined utterance intention based onthe conversation pattern of the other service domain.

The response information comprises response message informationcorresponding to the voice expression, and the display apparatus outputsa response message corresponding to the voice expression based on theresponse message information.

The response information further comprises a control command forcontrolling a function of the display apparatus.

As described above, according to the various exemplary embodiments,although a service domain including a voice expression is changed, it ispossible to output a response message corresponding to the voiceexpression based on the changed service domain. Therefore, it ispossible to interact between the user and the display apparatus withoutinterruption regardless of the service domain.

The above-described voice expression may be a voice expression of auser.

According to another aspect of an exemplary embodiment, there isprovided a server which interacts with a display apparatus. The servermay comprise: a storage which stores conversation patterns for servicedomains; a communicator configured to receive a voice expression fromthe display apparatus; and a controller configured to determine aservice domain among a plurality of service domains, each of saidplurality of service domains being related to a correspondingconversation pattern, generate response information corresponding to thevoice expression based on a conversation pattern of the determinedservice domain, and to control the communicator to transmit the responseinformation to the display apparatus.

The controller may control the communicator to determine an utteranceintention included in the received voice expression, generate theresponse information corresponding to the determined utterance intentionbased on the conversation pattern, and transmit the response informationto the display apparatus.

According to yet another exemplary embodiment, there is provided amethod of controlling a server which interacts with a display apparatus,the method comprising: receiving a voice expression from the displayapparatus; determining a service domain related to the voice expressionamong a plurality of service domains, using conversation patterns, eachof the plurality of service domains being related to correspondingconversation patterns; generating response information corresponding tothe voice expression based on a conversation pattern of the determinedservice domain; and transmitting the generated response information tothe display apparatus.

Additional aspects of the exemplary embodiments will be set forth in thedetailed description, will be obvious from the detailed description, ormay be learned by practicing the exemplary embodiments.

BRIEF DESCRIPTION OF THE DRAWING FIGURES

The above and/or other aspects will be more apparent by describing indetail exemplary embodiments, with reference to the accompanyingdrawings, in which:

FIG. 1 is a view illustrating an interactive system according to anexemplary embodiment;

FIG. 2 is a block diagram illustrating a configuration of a displayapparatus according to an exemplary embodiment;

FIG. 3 is a block diagram illustrating a detailed configuration of thedisplay apparatus illustrated in FIG. 2;

FIG. 4 is a block diagram illustrating a configuration of a serveraccording to an exemplary embodiment:

FIGS. 5A and 5B are views illustrating an operation of an interactivesystem according to an exemplary embodiment; and

FIG. 6 is a flowchart illustrating a method of controlling a serveraccording to an exemplary embodiment.

DETAILED DESCRIPTION OF THE EXEMPLARY EMBODIMENTS

Hereinafter, exemplary embodiments will be described in more detail withreference to the accompanying drawings.

In the following description, same reference numerals are used for thesame elements when they are depicted in different drawings. The mattersdefined in the description, such as detailed construction and elements,are provided to assist in a comprehensive understanding of the exemplaryembodiments. Thus, it is apparent that the exemplary embodiments can becarried out without those specifically defined matters. Also, functionsor elements known in the related art are not described in detail sincethey would obscure the exemplary embodiments with unnecessary detail.

FIG. 1 is a view illustrating an interactive system according to anexemplary embodiment. As shown in FIG. 1, the interactive systemincludes a display apparatus 100 and a server 200. Although the displayapparatus 100 may be a smart TV as shown in FIG. 1, this is merelyillustrative. The display apparatus may be implemented with variouselectronic devices, for example, a mobile phone such as a smart phone, adesktop personal computer (PC), a laptop PC, a navigation system, or thelike.

Herein, the display apparatus 100 may be controlled by a remotecontroller (not shown) to control the display apparatus 100. Forexample, if the display apparatus 100 is implemented with a television,the operation such as power on/off, channel change, and volumeadjustment may be performed according to a control signal received fromthe remote controller (not shown).

The display apparatus 100 may transmit a collected user's voice to theserver 200, and the server 200 may generate response informationcorresponding to the user's voice received from the display apparatus100 and transmit the generated response information to the displayapparatus 100. A user's “voice” may include voice data, a voicedstatement of a user, a voiced question of a user, a voiced sound of auser, a voiced expression, or the like.

Specifically, the server 200 may determine a service domain includingthe user's voice, and generate the response information of the userbased on a conversation pattern of the determined service domain. Forexample, when the voiced expression, “what is the name of the programbeing broadcasted currently?”, is collected, the server 200 maydetermine the received voice as a broadcasting service domain, determinea response corresponding to the received voice in a conversation patternstored in a broadcasting service domain, and transmit the response tothe display apparatus 100.

The display apparatus 100 may perform various operations based on theresponse information received from the server 200. In theabove-described example, the display apparatus 100 may output a message“The inquired title of the program is ∘∘∘” as the response for thevoiced expression, “What is the name of the program being broadcastedcurrently?”, in a voice or text form, or a combination of, on the basisof the response information.

Even when the service domain including the currently collected user'svoice is different from a service domain including a previouslycollected user's voice, the display apparatus 100 may output theresponse message corresponding to the currently corrected user's voiceor perform the function corresponding to the user's voice.

That is, when it is determined that the service domain including thecurrently received user's voice is different from the service domainincluding the previously received user's voice from the displayapparatus 100, the server 200 may generate the response informationcorresponding to the currently received user's voice using aconversation pattern of another service domain and transmit the responseinformation to the display apparatus 100.

In the above-described example, it is assumed that the voicedexpression, “How's the weather today?” is received from the displayapparatus 100 after the voiced expression, “What is the name of theprogram being broadcasted currently?”, is received. In this case, whenit is determined that the service domain including the currentlyreceived user's voice is a weather service domain, the server 200detects a response corresponding to the currently received user's voicein a conversation pattern stored in the weather service domain, andtransmits the detected response “Let's know the desired area.” to thedisplay apparatus 100.

As described above, the interactive system according to the exemplaryembodiment can provide a continuous response to the user even when thevoice currently uttered by the user is included in another servicedomain which is different from the service domain including thepreviously uttered voice.

FIG. 2 is a block diagram illustrating a configuration of a displayapparatus according to an exemplary embodiment. Referring to FIG. 2, thedisplay apparatus 100 includes an output unit 110, a voice collectionunit 120, a communication unit 130, and a control unit 140.

The output unit 110 outputs at least one of video and audio.Specifically, the output unit 110 may output the response messagecorresponding to the user's voice collected through the voice collectionunit 120 in at least one of a voice form, a text form, or a combinationthereof.

Thus, the output unit 110 may include a display unit (not shown) and anaudio output unit (not shown).

Specifically, the display unit (not shown) may be implemented with aliquid crystal display (LCD), an organic light emitting display (OLED),or a plasma display panel (PDP) and provide various display screenswhich can be provided through the display apparatus 100. In particular,the display unit (not shown) may display the response messagecorresponding to the user's voice in a text form or an image form.

Here, the display unit (not shown) may be implemented with a touchscreen type in which the display unit and a touch pad form a mutuallayer structure, and the touch screen may be configured to detect alocation, an area, pressure of a touch input.

The audio output unit (not shown) may be implemented with an output portsuch as jacks and the like or a speaker and output the response messagecorresponding to the user's voice in a voice form.

Meanwhile, the output unit 110 may output various video and audio.Herein, the video and audio may be video and audio constitutingbroadcasting contents, multi-media contents, and so on.

The voice collection unit 120 collects the user's voice. For example,the voice collection unit 120 may be implemented with a microphoneconfigured to collect the user's voice. The voice collection unit 120may be integrally implemented with the display apparatus 100 within thedisplay apparatus 100 or separately implemented from the displayapparatus 100. When the voice collection unit 120 is separatelyimplemented from the display apparatus 100, the voice collection unit120 may be implemented with a grasp type or a type which can be placedon a cooktable or a table. The voice collection unit 120 may beconnected to the display apparatus 100 through a wired or wirelessnetwork and transmit the collected user's voice to the display apparatus100.

In addition, the voice collection unit 120 may determine whether thecollected user's voice is a voice uttered by a user or not, and filternoise from the voice (for example, air conditioning sound, cleaningsound, music sound, and the like).

For example, when the analog user's voice is input, the voice collectionunit 120 samples the analog user's voice and converts the user's voiceinto a digital signal. The voice collection unit 120 calculates energyof the converted digital signal and determines whether or not the energyof the digital signal is equal to or larger than a preset value.

When it is determined that the energy of the digital signal is equal toor larger than the preset value, the voice collection unit 120 removes anoise and transmit a noise-removed voice. The noise component is asudden noise which can occur in the home environment such as airconditioning sound, cleaning sound, or music sound. When it isdetermined that the energy of the digital signal is less than the presetvalue, the voice collection unit 120 performs no processing on thedigital signal and waits for another input. Accordingly, the whole audioprocessing procedure is not activated by the other sounds other than theuser's uttered voice so that the unnecessary power consumption can beprevented.

The communication unit 130 performs communication with the server (200of FIG. 1). Specifically, the communication unit 130 may transmit theuser's voice to the server 200 and receive the response informationcorresponding to the user's voice from the server 200.

For this reason, the communication 130 may perform communication withthe server 200.

For example, the communication unit 130 may perform communication withthe server 200 using wired/wireless LAN (Local Area Network), WAN,Ethernet, Bluetooth, Zigbee, USB (Universal Serial Bus), IEEE1394, WiFi,and so on. To do so, the communication unit 130 may comprise a chip oran input port and the like corresponding to each communication method.For example, when communication is performed based on a wired LANmethod, the communication unit 130 may comprise a wired LAN card (notshown) and an input port (not shown).

The control unit 140 controls an overall operation of the displayapparatus 100. Specifically, the control unit 140 may control to thevoice collection unit 120 to collect a user voice and control to thecommunication unit 130 to transmit the collected user voice to theserver 200. In addition, the control unit 140 may control thecommunication unit 130 to receive the response information correspondingto the user's voice from the server 200.

The control unit 140 may control the output unit 110 to output aresponse message corresponding to the user's voice based on the responseinformation received from the server 200.

Here, the response information may include response message informationfor outputting the response message. The response message information isinformation in which the response message corresponding to the user'svoice is represented in a text form, and the control unit 140 may outputthe response message corresponding to the user's voice in at least oneof a voice form and a text form through the output unit 110, based onthe response message information.

Specifically, the control unit 140 may convert the text-type responsemessage information into voice using a Text to Speech (TTS) engine andoutput the voice through the output unit 110. Here, the TTS engine is amodule configured to convert a text into voice and may convert the textinto the voice using the disclosed various TTS algorithms. Further, thecontrol unit 140 configures a user interface (UI) screen including thetext constituting the response message information and outputs the UIscreen through the output unit 110.

For example, when the display apparatus 100 which is implemented with atelevision collects a user voice “when does the program start?”, theserver 200 may transmit “which program do you mean?” in the text form tothe display apparatus 100. In this case, the controller 140 may controlto convert “which program do you mean?” into a voice and output thevoice through the output unit 110, or may compose and output a screenincluding the text of “which program do you mean?” through the outputunit 110.

As described above, the case in which no function is performed in thedisplay apparatus and the control unit 140 outputs the response messagecorresponding to the user's voice may be a case in which the user'svoice may include contents requesting a function which cannot beexecuted in the display apparatus 100, to be executed, or may include aquestion which requires an answer.

For example, when the display apparatus 100 is implemented with a smartTV, and “Call XXX”, as the user's voice, is input, but a video callfunction is not supported, the control unit 140 may perform noprocessing, and output the response message “the function is notsupported” in at least one of a text form and a voice form through theoutput unit 110, based on the response message information. Further,when the display apparatus 100 is implemented with a smart TV, and“Let's know the name of the most popular program”, as the user's voiceis input, the control unit 140 may perform no processing and output theresponse message “The most popular program is ∘∘∘ (the name of theprogram)” in at least one of a text form and a voice form through theoutput unit 110, based on the response message information.

The response information may further include a control command forcontrolling the function of the display apparatus 100. Thus, the controlunit 140 may control the respective components of the display apparatus100 to execute a specific function executable in the display apparatus100 according to the type of the display apparatus 100, based on thecontrol command received from the server 200.

For example, when the display apparatus 100 is implemented with atelevision and “Turn up the volume”, as a user voice, is collected, theserver 200 may transmit a control command to turn up the volume of thedisplay apparatus 100 to the display apparatus 100. In this case, thecontrol unit 140 may increase audio volume output through the outputunit 110 based on the control command. However, this is only an example,the control unit 140 may control each component of the display apparatus100 so that various operations such as power on/off, channel change, andvolume adjustment can be performed according to a collected user voice.

In addition, the response information may include a control command forcontrolling the function of the display apparatus 100 and responsemessage information related to the specific function executed accordingto the control command. In this case, the control unit 140 may performthe function according to the executed command and output the responsemessage related to the control command in at least one of a text formand a voice form through the output unit 110.

For example, when the user's voice includes contents which can executethe function executable in the display apparatus 100, the control unit140 may perform the function intended by the user according to thecontrol command received from the server 200, and output informationrelated to the executed function in at least one of a text form and avoice form, based on the response message information received from theserver 200. For example, when the display apparatus 100 is implementedwith a smart TV, and “Change the channel to No. 11” is expressed as theuser's voice, the control unit 140 may select the channel No. 11according to the control command for changing the channel to No. 11, andoutput the response message “The channel has been changed to No. 11” or“Change in the channel has been completed” in at least one of a textform and a voice form through the output unit 110, based on the responsemessage information.

In addition, even when the service domain including the currentlycollected user's voice is different from the service domain includingthe previously collected user's voice, the control unit 140 may outputthe response message corresponding to the currently collected user'svoice or perform the function corresponding to the user's voice.

For example, it is assumed that the display apparatus 100 outputs theresponse message “The inquired program is ∘∘∘ (the name of the program)”based on the response information received from the server 200 withrespect to the previously collected user's voice “what is the name ofthe program being broadcasted currently?”. Next, when the user's voiceexpression, and “How is the weather today?”, which is included inanother service domain different from the service domain including thepreviously user's voice, is collected, the control unit 140 may outputthe response message “Let's know the desired area” in at least one of avoice form and a text form through the output unit 110, based on theresponse information received from the server 200.

That is, even when slots provided in the specific service domain are notcompletely filled according to the user's voice, when the user's voiceincluded in the other service domain is received next, the control unit140 may output the response message corresponding to the next receiveduser's voice.

Thus, when the user's voice included in the other service domain isreceived, the server 200 may generate the response informationcorresponding to the user's voice based on a conversation pattern of theother service domain, and transmit the response information to thedisplay apparatus 100. The above-described operation will be describedlater with reference to FIG. 4.

Although the above-described exemplary embodiment has illustrated theresponse message information transmitted from the server 200 in whichthe response message is represented in a text form, it is merelyillustrative. That is, the response message information is voice dataitself constituting the response message output from the displayapparatus 100 or a portion of the voice data constituting thecorresponding response message. Alternatively, the response messageinformation may have a form of a control signal for outputting thecorresponding response message using the voice or text pre-stored in thedisplay apparatus 100.

Therefore, the control unit 140 may output the response message byconsidering the type of the response message information. Specifically,when the voice message itself constituting the response message isreceived, the control unit 140 may process the corresponding data in aform which can be output in the output unit 110 and output the processeddata.

When the control signal for outputting the response message is received,the control unit 140 may search data which matches with the controlsignal among the data pre-stored in the display apparatus 100, processthe searched voice or text data in a form which can be output, andoutput the processed data through the output unit 110. Therefore, thedisplay apparatus 100 may have stored voice or text data for providingthe response message related to the function execution or voice or textdata related to request for providing the information. For example, thedisplay apparatus 100 may store data in a form of a complete sentencesuch as “Change in the channel is completed” or partial dataconstituting a sentence such as “The channel is changed to No.xx”. Inthis case, the name of the channel for allowing the correspondingsentence to be completed may be received from the server 200.

FIG. 3 is a block diagram illustrating a detailed configuration of thedisplay apparatus illustrated in FIG. 2. Referring to FIG. 3, thedisplay apparatus 100 may further include an input unit 150, a storageunit 160, a reception unit 170, and a signal processing unit 180 inaddition to the components illustrated in FIG. 2. Portions of thecomponents illustrated in FIG. 3 overlapping the components illustratedin FIG. 2 has the same function and thus detailed description thereofwill be omitted.

The input unit 150 is an input unit configured to receive a user'svarious operations and transmit the received operations to the controlunit 140 and may be implemented with an input panel. Here, the inputpanel may be a touch pad type, a key pad type including various functionkeys, number keys, specific keys, character keys, and the like, or atouch screen type. In addition, the input unit 160 may be implementedwith an infrared (IR) reception unit (not shown) configured to receive aremote control signal transmitted from a remote controller forcontrolling the display apparatus 100.

The input unit 150 may receive a user's various operations forcontrolling the function of the display apparatus 100. For example, whenthe display apparatus 100 is implemented with a smart TV, the input unit150 may receive the user's operation for controlling the function of thesmart TV such as power on/off, channel change, and volume adjustment. Inthis case, the control unit 140 may control the other components toperform various functions corresponding to the user's operation inputfrom the input unit 150. For example, when the power-off command isreceived, the control unit 140 may interrupt the power supplied to therespective components of the display apparatus 100. When thechannel-change command is received, the control unit 140 may control thereception unit 170 to select the channel selected by the user'soperation.

In particular, the input unit 150 receives the user's operation to starta voice recognition mode for collecting the user's voice. For example,the input unit 150 may be implemented with the touch screen typetogether with the display unit, and may display an object (for example,an icon) for receiving the voice recognition mode. Alternatively, theinput unit 150 may include a separate button for receiving the voicerecognition mode. When the user's operation to start the voicerecognition mode is received through the input unit 150, the controlunit 140 may activate the voice collection unit 120 and collect theuser's voice uttered within a preset range. Then, the control unit 140may control to receive the response information corresponding to theuser's voice collected through communication with the server 200, andoutput the response message or execute the specific function.

The storage unit 160 is a storage medium configured to store variousprograms necessary for operating the display apparatus 100, and may beimplemented with a memory, a hard disc drive (HDD), and the like. Forexample, the storage unit 160 may include a read only memory (ROM)configured to store a program for executing an operation of the controlunit 140, a random access memory (RAM) configured to temporarily storedata according to execution of the operation of the control unit 140,and the like. The storage unit 160 may further include an electricallyerasable and programmable ROM (EEPROM) configured to store variousreference data, and the like.

In particular, the storage unit 160 may have previously stored thevarious response messages corresponding to the user's voice in a voiceor text form. Therefore, the control unit 140 may read at least oneamong the voice and text data corresponding to the voice messageinformation (specifically, control signal) received from the server 200,and output the read data to an audio output unit 112 or a display unit111. In this case, the control unit 140 may perform signal processingsuch as decoding for voice data, amplify the decoded voice data, andoutput the amplified voice data to the audio output unit 112.Alternatively, the control unit 140 may configure a UI screen includinga text constituting the text data, and output the UI screen through thedisplay unit 111. Although the above-described exemplary embodiment hasillustrated that the control unit 140 perform the signal processing forthe voice and text data read from the storage unit 160, the control unitmay control the signal processing unit 180 to perform the signalprocessing for the voice and text data.

The reception unit 170 receives various contents. Specifically, thereception unit 170 receives the contents from a broadcasting stationconfigured to transmit broadcasting program contents using abroadcasting network or a web server configured to transmit contentfiles using Internet. Further, the reception unit 170 may receive thecontents from various recording medium reproducing apparatus provided inthe display apparatus 100 or connected to the display apparatus 100. Therecording medium reproducing apparatus is an apparatus configured toreproduce the contents stored in the various kinds of recording mediasuch as a compact disc (CD), a digital versatile disc (DVD), a blu-raydisc, a memory card, or a universal serial bus (USB) memory.

In an exemplary embodiment which receives contents from a broadcastingstation, the reception unit 170 may be implemented to include aconfiguration such as a tuner, a demodulator, an equalizer, and the likewhich are not illustrated in FIG. 3. In an exemplary embodiment whichreceives contents from a source such as the web server, the receptionunit 170 may be implemented with a network interface card (not shown).Alternatively, in an exemplary embodiment which receives contents fromthe above-described various recording medium reproducing apparatuses,the reception unit 170 may be implemented with an interface unit (notshown) connected to the recording medium reproducing apparatus. Asdescribed above, the reception unit 170 may be implemented with varioustypes according to the exemplary embodiments.

The signal processing unit 180 performs signal processing for thecontents received through the reception unit 170 to be output throughthe output unit 110.

Specifically, the signal processing unit 180 may perform an operation ona video signal included in the contents such as decoding, scaling, and aframe-rate conversation and convert the processed video signal into avideo signal in a form which can be output in the display unit 111.Further, the signal processing unit 180 may perform signal processing onan audio signal included in the contents, and convert the processedaudio signal into an audio signal in a form which can be output in theaudio output unit 112.

FIG. 4 is a block diagram illustrating a configuration of a serveraccording to an exemplary embodiment. Referring to FIG. 4, the server200 includes a storage unit 210, a communication unit 220, and a controlunit 230.

The storage unit stores conversation patterns for each of servicedomains. Here, the service domains may be categorized into“broadcasting”, “VOD”, “application management”, “apparatus management”,“information(weather, stock, news, and the like), and etc. according toa subject in which the voice uttered by the user is included. However,this is only an example, the service domains may be classified accordingto other various subjects.

More specifically, the storage unit 210 may comprise a corpus databasefor each service domain. Herein, the corpus database may be implementedto store exemplary sentences and the corresponding responses. That is,the storage unit 210 may store a plurality of exemplary sentences andresponses to each sentence for each service domain. In addition, thestorage unit 210 may tag information to interpret the exemplarysentences and expected responses to the sentences to each sentence andstore the same

For example, the storage unit 210 may include a first corpus databasefor a broadcasting service domain and a second corpus database for aweather service domain.

In this case, the first corpus database may store various conversationpatterns which can be generated in the broadcasting service domain.

For example, suppose a case where the exemplary sentence of “when doesthe program start?” is stored in a broadcasting service domain.

In this case, the storage unit 210 may tag information to interpretsentences such as “when does the program start?”to the correspondingsentence and store the same.

Specifically, the storage unit 210 may tag information that “program”means a broadcasting program, “when . . . start” is to ask about abroadcasting time, and “when . . . ?” means it is an interrogativesentence to the corresponding sentence and store the same.

In addition, the storage unit 210 may tag response to “what does theprogram start?” to the corresponding sentence and store the same.Specifically, the storage unit 210 may tag “which program do you want toknow?” as a response and store the same.

However, this is only an example, and the storage unit 210 may store thesentence, “when does OOO (name of a broadcasting program) start?”, andtag information to interpret the sentence and a response to thecorresponding sentence and store the same.

Specifically, with respect to the sentence such as “when does OOO (nameof a broadcasting program) start?”, the storage unit 210 may taginformation that “OOO (name of a broadcasting program)” means abroadcasting program, “when . . . start” is to ask about a broadcastingtime, and “when . . . ?” means it is an interrogative sentence to thecorresponding sentence and store the same. In addition, the storage unit210 may tag information that a word related to a broadcasting programappears in a sentence such as “when . . . ?” to the correspondingsentence and store the same. Herein, the word related to a broadcastingprogram may include the name of a broadcasting program, an actor, and aproducer.

In addition, the storage unit 210 may tag a response to “when does OOO(name of a broadcasting program) start?” to the corresponding sentenceand store the same. Specifically, the storage unit 210 may tag “thebroadcasting time of <the name of the broadcasting program> you asked is<broadcasting time>” as a response to “when does OOO (name of abroadcasting program) start?” and store the same.

As such, the storage unit 210 may store various conversation patterns ina broadcasting service domain.

The second corpus database may store the conversation pattern which canoccur in the weather service domain.

For example, suppose a case where “what is the weather like in OOO (nameof an area)?” is stored in a weather service domain.

In this case, the storage unit 210 may tag information to interpret thesentence such as “what is the weather like in OOO (name of an area)?” tothe corresponding sentence and store the same. Specifically, the storageunit 210 may tag information that “OOO (name of an area)” means the nameof an area, “what is the weather like . . . ” is to ask about weather,and “what . . . ?” means that it is an interrogative sentence to thecorresponding sentence and store the same.

In addition, the storage unit 210 may tag a response to “what is theweather like in OOO (name of an area)?” to the corresponding sentenceand store the same. Specifically, the storage unit 210 may tag “Do youwant to know the temperature?” as a response to “what is the weatherlike in OOO (name of an area)?” and store the same.

However, this is only an example, and the storage unit 210 may store thesentence of “what is the temperature of OOO (name of an area)?”, and maytag information to interpret the corresponding sentence and the responseof “the temperature of OOO (name of an area) is <temperature>” to thecorresponding sentence and store the same.

As such, the storage unit 210 may store various conversation patterns ina weather service domain.

In the above exemplary embodiment, exemplary sentences and thecorresponding responses stored in the storage unit are described.However, this is only an example, and various exemplary sentences andcorresponding responses may be stored in each service domain.

The communication unit 220 receives the user's voice from the displayapparatus 100. The communication unit 220 may include a NFC module (notshown) configured to perform communication with the display apparatus100 and various communication modules such as a wireless communicationmodule (not shown). In addition, the communication unit 220 may performcommunication with the display apparatus 100 using wired/wireless LAN,and the like.

The communication unit 220 may perform communication with the web serverthrough the internet network, transmit various search keywords to theweb server, and receive a web searching result corresponding thereto.Here, the search keywords may include various web-searchable keywordssuch as a weather-related keyword (for example, an area name,temperature, chance of rain, and the like), and a content-relatedkeyword (for example, a title of a movie, a release day of a movie, atitle of music, a singer, and the like) and may be pre-stored in thestorage unit 210.

The control unit 230 controls an overall operation of the server 200. Inparticular, the control unit 230 determines a service domain includingthe user's voice, generates the response information corresponding tothe user's voice based on the conversation pattern of the determinedservice domain, and controls the communication unit 220 to transmit theresponse information to the display apparatus 100.

Thus, the control unit 230 converts the received user's voice into atext using a Speech to Text (STT) algorithm. The control unit 230 maydetect the user's utterance matched with the text converted in eachcorpus database, and determine the service domain including the user'svoice.

Specifically, the control unit 230 may compare a text with sentencesstored in each service domain, and determine a service domain where asentence matching with a received text belongs as a service domainincluding a user's voice.

Then, the control unit 230 extracts a dialog act, a main action, and acomponent slot from the user's voice based on the service domainincluding the user's voice. For example, the control unit 230 mayextract the dialogue act and the main action using a Maximum EntropyClassifier (MaxEnT) and the component slot using a conditional RandomField (CRF), from the user's voice. However, the extraction operation ofthe user's voice is not limited thereto, and the dialogue act, the mainaction, and the component slot may be extracted through known variousmethods. For example, the control unit 230 may extract a dialogue act, amain action, and a component slot from a user voice using informationtagged to a sentence matched with the user voice.

Here, the dialogue act indicates whether the corresponding sentence is astatement, a request, a WH-question, or an YN-question. The main actionis semantic information representing the desired action by thecorresponding utterance through the conversation in a specific domain.For example, the main action may include TV on/off, seeking a program,finding a programming time, program scheduling, and the like. Thecomponent slot is object information for the specific domain representedin the utterance, that is, additional information for specify means ofan action intended in the specific domain. For example, the componentslot may include a genre, a title of a program, a starting time, a nameof a channel, a name of an actor, and the like.

The control unit 230 may determine utterance intention included in theuser's voice using the extracted dialogue act, main action, andcomponent slot, generate response information corresponding to thedetermined utterance intention, and transmit the response information tothe display apparatus 100.

Here, the response information includes response message informationcorresponding to the user's voice. The response message information maybe information in which the response message output from the displayapparatus 100 is represented in a text form. The display apparatus 100may output the response message corresponding to the user's voice basedon the response message information received from the server 200.

Specifically, the control unit 230 may extract the response for thedetermined utterance intention from the corpus database, convert theextracted response into a text, and generate the response messageinformation.

For example, it is assumed that the user's voice to “When is the time tostart ∘∘∘ (the name of the program)” is received from the displayapparatus 100. In this case, the control unit 230 may search the corpusdatabase present in the user's utterance matched with the user's voiceconverted in a text form, and determine that the voice “When is the timeto start ∘∘∘ (the name of the program)” is included in the broadcastingservice domain.

Then, the control unit 230 determines that the sentence of thecorresponding voice is a question through the dialogue act, and that theuser wants to know “the starting time of the program for ∘∘∘” throughthe main action and the component slot. As a result, the control unit230 may determine that the utterance intention included in the receiveduser's voice is to “inquire a starting time of the program for ∘∘∘ (thename of the program)”.

Then, the control unit 230 extracts the response “the inquired startingtime of ∘∘∘ is xxx” from the corpus database of the broadcasting servicedomain with respect to the utterance intention for “inquiring thestarting time of the program for ∘∘∘”. That is, the control unit 230 maysearch a response matched with “When is the time to start ∘∘∘ (the nameof the program)?” from the corpus database of a broadcasting servicedomain, and extract “the starting time of the program for ∘∘∘ is . . . ”as a response. In this case, the control unit 230 may search thebroadcasting starting time for “∘∘∘” using electronic program guide(EPG) information, generate response message information “The inquiredstarting time of ∘∘∘ is on Saturday at p.m. 7:00”, and transmit theresponse message information to the display apparatus 100.

In addition, the control unit 230 may determine utterance intention of auser by using information tagged to a sentence matched with a receivedtext.

For example, suppose a case where the text of “when does the program for∘∘∘ (the name of the program) start?” is received from the displayapparatus 100.

In this case, the control unit 230 may determine that the received textbelongs to a broadcasting service domain and extract a dialogue act, amain action, and a component slot from the user voice using informationtagged to “when does the program for ∘∘∘ (the name of the program)start?” which is the sentence matched with the received text in thebroadcasting service domain so as to find out the utterance intention ofthe user voice.

That is, as information to interpret the sentence of “when does theprogram for ∘∘∘ (the name of the program) start?”, the information that“∘∘∘ (the name of the program)” means a broadcasting program, “when . .. start” is to inquire about the broadcasting time, and “when . . . ?”means it is an interrogative sentence is tagged. Accordingly, based theinformation, the control unit 230 may determine that the dialogue act ofthe received text of “when does the program for ∘∘∘ (the name of theprogram) start?” is an interrogative sentence, the main action isinquiring about the broadcasting time, and the component slot is ∘∘∘(the name of the program). Accordingly, the control unit 230 maydetermine that the utterance intention of the user voice is to “inquire”about “the broadcasting time” of “∘∘∘ (the name of the program)”. Inaddition, in response to the utterance intention of “inquiring” about“the starting time of the program” of “∘∘∘ (the name of the program)”,the control unit 230 may extract “the starting time of ∘∘∘ is<broadcasting time>” from the corpus database of the broadcastingservice domain.

In this case, the control unit 230 may generate a sentence in a completeform by completing a blank included in a searched response.

For example, the control unit 230 may complete the response of “thebroadcasting time of <blank (name of a broadcasting program> is<broadcasting time>” by write “∘∘∘ (the name of the program)” in theblank. In addition, the control unit 230 may search the broadcastingtime of “∘∘∘ (the name of the program)” using EPG (Electronic ProgramGuide) information and write the searched broadcasting time in anotherblank of <broadcasting time>. Accordingly, the control unit 230 maygenerate response message information corresponding to the user voiceusing the complete sentence of “the broadcasting time of ∘∘∘ (the nameof the program) is 7 o'clock on Saturday”, and transmit the generatedresponse message information to the display apparatus 100.

Accordingly, the display apparatus 100 may output “the broadcasting timeof ∘∘∘ (the name of the program) is 7 o'clock on Saturday” in either avoice or a text form based on the response message information receivedfrom the server 200.

Further, the response information may further comprise a control commandfor controlling the function of the display apparatus 100. That is, thecontrol unit 230 may generate the control command for causing thedisplay apparatus 100 to perform the function corresponding to theutterance intention of the user.

Thus, the storage unit 210 may have pre-stored the control commandcorresponding to the user's utterance intention. For example, when theuser's utterance intention is to change the channel, the storage unit210 may match and store the control command for changing the channel ofthe display apparatus 110. When the user's utterance intention is toschedule a recording, the storage unit 210 may match and store thecontrol command for executing the scheduled recording for a specificprogram in the display apparatus 100.

It is assumed that the user's voice “schedule ∘∘∘ (the name of theprogram)” is received from the display apparatus 100. In this case, thecontrol unit 230 may search the corpus database including the user'sutterance matching with the user's voice which is converted in a textform, and determine that the user's voice “schedule ∘∘∘” is included inthe broadcasting service domain.

The control unit 230 determines that the corresponding voice is asentence related to a “Request” through the dialogue act and that theuser wants to “schedule the program for ∘∘∘ (the name of the program)”through the main action and the component slot. As a result, the controlunit 230 may determine that the utterance intention included in thereceived user's voice is “Request the program scheduling for ∘∘∘ (thename of the program)”.

Then, the control unit 230 may detect the control command correspondingto the utterance intention “Request the program scheduling for ∘∘∘ (thename of the program)” in the storage unit 210, and generate the controlcommand for executing the scheduled recording function of ∘∘∘ (the nameof the program)” in the display apparatus 100. In this case, the controlunit 230 may extract the response message information “The scheduledrecoding for the requested program is completed” to the utteranceintention “Request the program scheduling for ∘∘∘ (the name of theprogram)” from the corpus database of the broadcasting service domain,and transmit the response message information to the display apparatus100.

When the control unit 230 can not determine the user's utteranceintention included in the currently received user's voice, the controlunit 230 may determine the user's utterance intention included in thecurrently received user's voice with reference to the previouslyreceived user's voice. That is, the control unit 230 may compare thecurrently received user's voice with the conversation pattern stored inthe corpus database to determine whether or not the currently receiveduser's voice is corresponding to the first user's utterance in theconversation pattern, and may determine the user's utterance intentionincluded in the currently received user's voice with reference to thepreviously received user's voice when the currently received user'svoice is not corresponding to the first user's utterance.

For example, it is assumed that the user's voice “When is the time tostart ∘∘∘ (the name of the program)?” is input and then the user's voice“The casts?” is input. In this case, when the control unit 230determines that the user's voice “The casts?” is not corresponding tothe first user's utterance in the broadcasting service domain, thecontrol unit 230 determines the utterance intention of “The casts?”based on the user's voice “When is the time to start ∘∘∘?” previouslyreceived.

That is, to determine the utterance intention of the user's voice “Thecasts?” which can not extract the component slot, the control unit 230may determine that the utterance intention of “The casts?” is “toinquire the cast for ∘∘∘ (the name of the program)” using “∘∘∘ (the nameof the program)” included in the previously received user's voice.

As another example, it is assumed that the user's voice “□□□ (the nameof the program)?” is input. In this case, to determine the utteranceintention of “□□□ (the name of the program)” which can extract the mainaction“, the control unit 230 may determine that the utterance intentionincluded in “□□□ (the name of the program)?” is “to inquire the startingtime of the program “□□□ (the name of the program)” using “When”included in the previously received user's voice.

When it is determined that the currently received user's voice isincluded in another service domain which is different from the servicedomain including the previously received user's voice, the control unit230 generates the response information corresponding to the currentlyreceived user's voice based on a conversation pattern of the otherservice domain.

Specifically, even before all the slot constituting the service domainare filled with the previously received user's voice, the control unit230 may generate the response information corresponding to the currentlyreceived user's voice based on the conversation pattern of the otherservice domain when it is determined that the currently received user'svoice is included in the other service domain.

Here, the slot denotes a blank filled with the component slot extractedfrom the user's voice in each service domain. For example, in the caseof the broadcasting service domain, the slot denotes the blank filledwith the component slot such as a genre, a title of a program, astarting time, a name of a channel, or a name of an actor.

An exemplary embodiment determines the service domain including thereceived user's voice even when all empty slots constituting the servicedomain are not filled, and determines the utterance intention of thecorresponding voice even when the user's voice is included in anotherservice domain which is different from the service domain including thepreviously received user's voice.

For this reason, the control unit 230 determines whether or not thecurrently received user's voice matches with the corpus database of theservice domain including the previously received user's voice.

When it is determined that currently received user's voice matches withthe corpus database, the control unit 230 determines the utteranceintention for the currently received user's voice using thecorresponding service domain, and generates the response informationcorresponding to the utterance intention. The operation has beendescribed above and thus overlapping description thereof will beomitted.

When it is determined that currently received user's voice does notmatch with the corpus database, the control unit 230 compares thecurrently received user's voice with another corpus database, anddetermines the service domain including the currently received user'svoice.

Next, the control unit 230 may determine the utterance intentionincluded in the currently received user's voice, generate the responseinformation corresponding to the determined utterance intention based onthe conversation pattern of the other service domain, and control thecommunication unit 210 to transmit the response information to thedisplay apparatus 100.

That is, the control unit 230 extracts the dialogue act, the mainaction, and the component slot from the user's voice based on theservice domain including the currently received user's voice. Then, thecontrol unit 230 may determine the utterance intention included in thecurrently received user's voice using the extracted dialogue act, mainaction, and component slot, generate the response informationcorresponding to the determined utterance intention, and transmit theresponse information to the display apparatus 100. The detaileddescription thereof will be described with reference to FIG. 5.

FIG. 5 is a view illustrating an operation of an interactive systemaccording to an exemplary embodiment.

As shown in FIG. 5A, the display apparatus 300 interacts with a server(not shown), and may output the response message “What program do youwant to know?” in a voice form when the user's voiced expression, “Whenis the time to start the program?”, is received.

Then, as shown in FIG. 5B, it is assumed that the user's voice “When isthe time to start the program?” included in the broadcasting servicedomain and the user's voice “What is the temperature in Seoul?” includedin another service domain are received. In this case, the serverdetermines the service domain including the user's voiced expression,“What is the temperature in Seoul?”, even when all slots constitutingthe broadcasting service domain are not filled. Thus, when it isdetermined that the corresponding voice is included in the weatherservice domain, the server extracts the dialogue act, the main action,and the component slot from “What is the temperature in Seoul?” based onthe weather service domain, and determines that the utterance intentionincluded in the currently received user's voice is “to inquiretemperature information of Seoul”.

Then, the server extracts the response “The inquired current temperatureof Seoul is xxx” to the utterance intention to “inquire the temperatureinformation of Seoul” from the corpus database of the weather servicedomain. In this case, the server may extract “Seoul” and “Temperature”as a keyword from the corresponding voice and then transmit the keywordsto a web server, and receive the searching result for the temperature ofSeoul from the web server and then transmit the response messageinformation “The current temperature of Seoul is 23° C.” to the displayapparatus 300. Therefore, the display apparatus 300 may output “Thecurrent temperature of Seoul is 23° C.” as the response to “What is thetemperature of Seoul?” included in the other service domain which isdifferent from the domain including the previously received user'svoice, in a voice form.

The above-described exemplary embodiment has illustrated that the server200 is a single server, but this is merely illustrative.

That is, the server 200 may include a first server (not shown)configured to receive the user's voice from the display apparatus 100,convert the received user's voice into a text, and transmit theconverted text to the display apparatus 100, and a second server (notshown) configured to receive the converted text from the displayapparatus 100, generate the response information corresponding to theuser's voice, and transmit the response information to the displayapparatus 100.

Specifically, the first server may convert the received user's voicefrom the display apparatus 100 into the text using an STT algorithm, andtransmit the converted text to the display apparatus 100. The secondserver may determine the user's utterance intention based on thereceived text, generate the response information corresponding to theuser's utterance intention, and transmit the response information to thedisplay apparatus 100.

FIG. 6 is a flowchart illustrating a method of controlling a serverwhich interacts with a display apparatus according to an exemplaryembodiment.

First, the server receives a user's voice from the display apparatus(S410).

Next, the server determines a service domain including the user's voiceusing conversation patterns for pre-stored service domains (S420).

Subsequently, when it is determined that a currently received user'svoice is included in another service domain which is different from adomain including a previously received user's voice (S430-Y), the servergenerates response information corresponding to the currently receiveduser's voice based on the conversation pattern of the other servicedomain (S440).

Specifically, when it is determined that the currently received user'svoice is included in another service domain even before all the slotsconstituting the current service domain are filled, the server maygenerate the response information corresponding to the currentlyreceived user's voice based on the conversation pattern of the otherservice domain using the previously received user's voice. Here, theserver may determine the utterance intention included in the user'svoice, and generate the response information corresponding to thedetermined utterance intention based on the conversation pattern of theother service domain.

When the currently received user's voice is included in the same servicedomain as the service domain including the previously received user'svoice (S430-N), the server generates the response informationcorresponding to the currently received user's voice based on theconversation pattern of the service domain including the previouslyreceived user's voice (S450).

Next, the server transmits the generated response information to thedisplay apparatus (S460).

The response information may include response message informationcorresponding to the user's voice and the display apparatus may output aresponse message corresponding to the user's voice based on the responsemessage information.

Further, the response information may further include a control commandfor controlling a function of the display apparatus.

These exemplary embodiments have described in detailed and thusoverlapping description thereof will be omitted.

A non-transitory computer-readable medium in which a program forsequentially executing the control method according to the exemplaryembodiment may be provided.

The non-transitory computer-readable medium is not a medium configuredto temporarily store data such as a register, a cache, or a memory, buta medium configured to semi-permanently store data and be readable by anapparatus. Specifically, the above-described various applications orprograms may be stored in the non-transitory readable medium such as acompact disc (CD), a digital versatile disc (DVD), a hard disc, ablu-ray disc, a universal serial bus (USB), a memory card, and a readonly memory (ROM) and provided.

A bus is not illustrated in the block diagram illustrating the displayapparatus and the server, but the communication between the respectivecomponents in the display apparatus and server may be performed throughthe bus. Further, a processor performing the above-described varioussteps such as a central processing unit (CPU) or a microprocessor may befurther included in each device.

The foregoing exemplary embodiments and advantages are merely exemplaryand are not to be construed as limiting the present inventive concept.The exemplary embodiments can be readily applied to other types ofapparatuses. Also, the description of the exemplary embodiments isintended to be illustrative, and not to limit the scope of the claims,and many alternatives, modifications, and variations will be apparent tothose skilled in the art.

What is claimed is:
 1. A server which interacts with a displayapparatus, the server comprising: a storage which stores conversationpatterns for service domains; a communicator configured to receive avoice expression from the display apparatus; and a controller configuredto determine a service domain related to the voice expression, generateresponse information corresponding to the voice expression based on aconversation pattern of the determined service domain, and to controlthe communicator to transmit the response information to the displayapparatus, wherein when it is determined that a currently received voiceexpression is included in another service domain which is different froma service domain including a previously received voice expression, thecontroller generates the response information corresponding to thecurrently received voice expression based on a conversation pattern ofthe other service domain.
 2. The server as claimed in claim 1, whereinwhen it is determined that the currently received voice expression isincluded in the other service domain even before slots constituting theservice domain are filled with the previously received voice expression,the controller generates the response information corresponding to thecurrently received voice expression based on the conversation pattern ofthe other service domain.
 3. The server as claimed in claim 2, whereinthe controller determines an utterance intention included in thecurrently received voice expression, generate the response informationcorresponding to the determined utterance intention based on theconversation pattern of the other service domain, and controls thecommunicator to transmit the response information to the displayapparatus.
 4. The server as claimed in claim 1, wherein the responseinformation comprises response message information corresponding to thevoice, and the display apparatus outputs the response messagecorresponding to the voice based on the response message information. 5.The server as claimed in claim 4, wherein the response informationfurther comprises a control command for controlling a function of thedisplay apparatus.
 6. A method of controlling a server which interactswith a display apparatus, the method comprising: receiving a voiceexpression from the display apparatus; determining a service domainincluding the voice, using conversation patterns; generating responseinformation corresponding to a currently received voice expression basedon a conversation pattern of another service domain when the currentlyreceived voice expression is included in the other service domain whichis different from a domain having a previously received voiceexpression; and transmitting the generated response information to thedisplay apparatus.
 7. The method as claimed in claim 6, wherein thegenerating comprises generating the response information correspondingto the currently received voice based on the conversation pattern of theother service domain when it is determined that the currently receivedvoice expression is included in the other service domain before slotsconstituting the service domain are filled with the previously receivedvoice expression.
 8. The method as claimed in claim 7, wherein thegenerating includes determining an utterance intention included in thevoice expression and generating the response information correspondingto the determined utterance intention based on the conversation patternof the other service domain.
 9. The method as claimed in claim 6,wherein the response information includes response message informationcorresponding to the voice expression, and the display apparatus outputsa response message corresponding to the voice expression based on theresponse message information.
 10. The method as claimed in claim 9,wherein the response information further includes a control command forcontrolling a function of the display apparatus.
 11. The server asclaimed in claim 1, wherein the voice expression is a voice expressionof a user.
 12. The method as claimed in claim 6, wherein the voiceexpression is a voice expression of a user.
 13. A server which interactswith a display apparatus, the server comprising: a storage which storesconversation patterns for service domains; a communicator configured toreceive a voice expression from the display apparatus; and a controllerconfigured to determine a service domain among a plurality of servicedomains, each of said plurality of service domains being related to acorresponding conversation pattern, generate response informationcorresponding to the voice expression based on a conversation pattern ofthe determined service domain, and to control the communicator totransmit the response information to the display apparatus.
 14. Theserver as claimed in claim 13, wherein the controller determines anutterance intention included in the received voice expression, generatethe response information corresponding to the determined utteranceintention based on the conversation pattern, and controls thecommunicator to transmit the response information to the displayapparatus.
 15. A method of controlling a server which interacts with adisplay apparatus, the method comprising: receiving a voice expressionfrom the display apparatus; determining a service domain related to thevoice expression among a plurality of service domains, usingconversation patterns, each of the plurality of service domains beingrelated to corresponding conversation patterns; generating responseinformation corresponding to the voice expression based on aconversation pattern of the determined service domain; and transmittingthe generated response information to the display apparatus.